C implementation of LSTM #31

nkoumchatzky · 2017-05-31T16:01:55Z

Pure C implementation for LSTMs with basic LSTM cell, including pre-multiplication and CUDA-like input format [(sum(T_i)) X outputSize], where T_i are the number of timesteps for each element of the batch, in decreasing value.

* New "sparse/size" representation * Full LSTM in C * VSeqLSTM to wrap this data representation + C implementation * Augmentation of the VariableLength decorator with this data representation from an array of tensors * unit tests * speed tests

mirandaconrado

The code itself looks good in general. There are lots of small changes that I think should be made either for correctness or clarity.

mirandaconrado · 2017-06-02T22:24:24Z

lib/THRNN/THRNN.h

+typedef int THInteger_t;
+typedef void THRNNState;
+
+#define THRNN_resizeAs_indices(I1, I2)                    \


Put the code in the macro between brackets for safety.

mirandaconrado · 2017-06-02T22:31:11Z

lib/THRNN/generic/LSTM.c

+#include <omp.h>
+#endif
+
+/* Set tensor->size[0] MACRO */


Not only first position.

mirandaconrado · 2017-06-02T22:31:22Z

lib/THRNN/generic/LSTM.c

+#define THRNN_LSTM_SET_SIZE(t, dim, newSize) ( t->size[dim] = newSize )
+#endif
+
+/* Set tensor->size[0] MACRO */


Not only first and stride.

mirandaconrado · 2017-06-02T22:33:22Z

lib/THRNN/generic/LSTM.c

+// and a LongStorage of default sizes for new individual buffers
+struct THRNN_(buffer)* THRNN_(create_buffer)(THTensor* buf, THLongStorage* default_buffer_sizes)
+{
+   THTensor** arr;


mirandaconrado · 2017-06-02T22:56:42Z

lib/THRNN/generic/LSTM.c

+   buffer->sizes = (int*)realloc(buffer->sizes, buffer->len * sizeof(int));
+   buffer->sizes[buffer->len-1] = size;
+   buffer->array = (THTensor**)realloc(buffer->array, buffer->len * sizeof(THTensor**));
+   THTensor* new_guy = THTensor_(new)();


mirandaconrado · 2017-06-04T17:46:21Z

VariableLength.lua

+      self.cgradInput = self.cgradInput:type(first_input:type())
+      self._input = self._input or first_input.new()
+      self._input = self._input:type(first_input:type())
+      self._input = self._input or {}


self._input will always be defined as a tensor in the last line

mirandaconrado · 2017-06-04T17:51:53Z

test/bigtest.lua

-   local input = torch.Tensor(seqlen, batchsize, inputsize)
-   local gradOutput = torch.Tensor(seqlen, batchsize, outputsize)
+   local input = torch.Tensor(seqlen, batchsize, inputsize):uniform()
+   local gradOutput = torch.Tensor(seqlen, batchsize, outputsize):uniform()

   local t = torch.Timer()


t as replaced by tic/toc

mirandaconrado · 2017-06-04T17:52:04Z

test/bigtest.lua

      -- main test
+      collectgarbage()
      t:reset()


mirandaconrado · 2017-06-04T17:53:37Z

test/test.lua

@@ -2828,6 +2828,110 @@ function rnntest.SeqLSTM_Lua_vs_C()
   end
 end

+function rnntest.SeqLSTM_vs_VSeqLSTM()


This benchmark is duplicated above, except that one uses double and the other float.

mirandaconrado · 2017-06-04T17:54:03Z

test/test.lua

      if not testLM then
         for i=1,batchSize do
-            input[i] = torch.randn(torch.random(1,maxLength), hiddenSize)
+            input[i] = torch.randn(i,hiddenSize)--torch.random(1,maxLength), hiddenSize)


Remove comment.

nkoumchatzky force-pushed the nkoumchatzky/clstm branch 2 times, most recently from c4ff053 to d225b51 Compare June 3, 2017 00:59

C implementation of LSTM -- REVIEWABLE

8f8677a

* New "sparse/size" representation * Full LSTM in C * VSeqLSTM to wrap this data representation + C implementation * Augmentation of the VariableLength decorator with this data representation from an array of tensors * unit tests * speed tests

nkoumchatzky force-pushed the nkoumchatzky/clstm branch from d225b51 to 8f8677a Compare June 3, 2017 22:52

nkoumchatzky changed the title ~~[WIP] - C implementation of LSTM~~ C implementation of LSTM Jun 4, 2017

mirandaconrado requested changes Jun 4, 2017

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

C implementation of LSTM #31

C implementation of LSTM #31

nkoumchatzky commented May 31, 2017

mirandaconrado left a comment

mirandaconrado Jun 2, 2017

mirandaconrado Jun 2, 2017

mirandaconrado Jun 2, 2017

mirandaconrado Jun 2, 2017

mirandaconrado Jun 2, 2017

mirandaconrado Jun 4, 2017

mirandaconrado Jun 4, 2017

mirandaconrado Jun 4, 2017

mirandaconrado Jun 4, 2017

mirandaconrado Jun 4, 2017

C implementation of LSTM #31

Are you sure you want to change the base?

C implementation of LSTM #31

Conversation

nkoumchatzky commented May 31, 2017

mirandaconrado left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment